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Abstract 

Since the celebrated work of Jerrum, Sinclair, and Vigoda, we have known that the perma- 
nent of a {0, 1} matrix can be approximated in randomized polynomial time by using a rapidly 
mixing Markov chain to sample perfect matchings of a bipartite graph. A separate strand of the 
literature has pursued the possibility of an alternate, algebraic polynomial-time approximation 
scheme. These schemes work by replacing each 1 with a random element of an algebra A, and 
considering the determinant of the resulting matrix. 

In the case where A is noncommutative, this determinant can be defined in several ways. 
We show that for estimators based on the conventional determinant, the critical ratio of the 
second moment to the square of the first — and therefore the number of trials we need to obtain 
a good estimate of the permanent — is (1 -I- 0{l/d))" when A is the algebra of d x d matrices. 
These results can be extended to group algebras, and semi-simple algebras in general. 

We also study the symmetrized determinant of Barvinok, showing that the resulting esti- 
mator has small variance when d is large enough. However, if d is constant — the only case in 
which an efficient algorithm is known — we show that the critical ratio exceeds 2^ jrf^'^^ . Thus 
our results do not provide a new polynomial-time approximation scheme for the permanent. In- 
deed, they suggest that the algebraic approach to approximating the permanent faces significant 
obstacles. 

We obtain these results using diagrammatic techniques in which we express matrix products 
as contractions of tensor products. When these matrices are random, in either the Haar measure 
or the Gaussian measure, we can evaluate the trace of these products in terms of the cycle 
structure of a suitably random permutation. In the symmetrized case, our estimates are then 
derived by a connection with the character theory of the symmetric group. 

1 Introduction 

The 'permanent of an n x n matrix A is perm^ = X^j^eS IlILi ^i,7ri) where Sn denotes the group of 
permutations of n objects. If Aij G {0, 1} for all i, j, we can also write perm^ = |{7r E 5^ | tt h 
where vr h A denotes the following relation, 

vr h yl 44> Aj^TTi = 1 for ah i . 

Computing the permanent of a {0, 1} matrix is #P-complete. Therefore, we cannot expect to 
compute it efficiently without startling complexity-theoretic consequences, including the collapse 
of the polynomial hierarchy |Val791 ITod91] . 
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On the other hand, Godsil and Gutman |GG81] pointed out the following charming fact. If 
we define the matrix- valued random variable M so that Mij = pijAij, where the pij are chosen 
independently and uniformly from {±1}, and define X = (det M)^, then it is easy to check that X 
is an estimator for perm^, which is to say that K[X] = perm A. Since det M can be computed effi- 
ciently, so can X. This suggests a natural randomized approximation algorithm for the permanent: 
average a family of independent samples of X. 

The quality of this approximation can be controlled by determining the variance of X. If Xt 
denotes the average of X over t independent trials, then Chebyshev's inequality shows that, in 
order for Xt to yield an approximation of K[X] within a factor a = 0(1) with probability ^2(1), the 
number of trials we need is at most 

VarX E[X2] 

t ~ — p TTT < 



E[X]2 - E[X]2 ■ 

Following |CRS03j . we refer to this quantity as the critical ratio of the estimator. Karmarkar, Karp, 
Lipton, Lovasz, and Luby |KKL^93 showed, unfortunately, that the critical ratio is 3"/2 in the 



worst case, ignoring poly(n) factors. Then again, they showed that we can decrease this to 2"/^ by 
drawing pij uniformly from the unit circle in the complex plane, or simply from the cube roots of 
unity, instead of {±1}. Barvinok |Bar99j obtained a more concentrated estimator by drawing pij 
from normal distributions over M, C, and the quaternions H. 

This raises the interesting possibility that, by choosing the pij from the right set of algebraic 
objects, we might be able to reduce the critical ratio to e''^"'), or even to poly(n), resulting in a 
subexponential or polynomial-time algorithm. One exciting result in this direction is due to Chien, 
Rasmussen, and Sinclair |CRS03] . who showed that certain determinants defined over the Clifford 
algebra CL^ with k generators give estimators where the critical ratio is (1 -|- 0(2~^/2^^"/2_ jj^ ^^j^g 
case of the quaternion group, where A: = 3, they gave a polynomial-time algorithm for a type of 
determinant where the critical ratio grows as (3/2)"/2_ This is currently the best known critical 
ratio for an algebraic estimator which can be computed efficiently. Sadly, however, for larger k we 
do not know how to compute these determinants in polynomial time. 

These results can be given a uniform presentation by defining a notion of determinant for a 
matrix M over an associative algebra A. The traditional Cayley determinant is then 

n 

detM= ^(-l)"J]M,,„i, (1) 

a&Sn i=l 

where (—1)" denotes the sign of the permutation a. Note that detM takes values in A. If A is 
noncommutative, however, the determinant as defined in ([T]) may depend on the order in which the 
product is taken. As written, each traversal Mi^ai is ordered from the top row to the bottom row; we 
could just as easily order them from the left column to the right. This introduces some arbitrariness 
to the definition, and appears to complicate the problem of computing such determinants, even when 
the algebra A has small dimension [ Nis91| . 

One natural remedy is to remove this order dependence by forcibly symmetrizing each product 
appearmg m ^. This gives the following symmetrized determinant, 

sdetM = - (-ir'""n^-."'- (2) 
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Observe that sdet is obtained by symmetrizing each product appearing in ([T|. This definition is 
due to Barvinok |BarOO] . who showed that if A has dimension m, the symmetrized determinant 
can be computed in time 0(n"^+*^(^)). In contrast, no efficient algorithm is currently known for the 
unsymmetrized Cayley determinant ([T]), even when the dimension of A is constant. 

We focus on the algebra Ad, consisting of all d x d matrices over C. We remark that any finite 
dimensional C*-algebraJ^ which appear to be the natural settings for such approximations, are 
semi-simple, meaning that they can be decomposed as a direct product of algebras of the form Ad- 
In particular, all group algebras and the Clifford algebras studied in [CRS03J have this property. 
It follows that many of our results, especially lower bounds on the critical ratio, carry over easily 
to estimators based on suitable distributions in semisimple algebras. 

Now, given a matrix A with entries in {0, 1}, define Mij = pijAij, where the pij are indepen- 
dently random d x d matrices. (We focus on {0, 1} matrices, but we can let the Aij be arbitrary 
nonnegative reals by taking Mjj = pijy^ Aij.) Since the Mij take values in Ad, then so do det M 
and sdet M. There are several ways to turn these matrix- valued determinants into real-valued 
estimators for the permanent of a real-valued matrix A. As mentioned above, most of the existing 
literature has focused on the Frobenius norm of these determinants. For technical reasons, we focus 
first on the absolute value squared of their trace. This gives us two estimators, 

X = |trdetM|^ and = |tr sdet . 

Note that these are random R- valued variables depending on the pij. We will then address the 
Frobenius estimators, 

1 1 2 1 1 2 

-'^Frob = ||detM|| and Xprob.s = ||sdetM|| . 

As an additional degree of freedom, we can draw pij according to two different distributions on Ad- 
The Haar measure is the uniform distribution over unitary matrices. In the Gaussian measure, 
each entry of pij is drawn independently from the Gaussian distribution on C with mean and 
variance 1/d: that is, its real and imaginary parts are drawn independently from the Gaussian 
distribution on M with mean and variance l/{2d), p{x) = /y^- 
Our main contribution is given by the following theorems. 

Theorem 1. For both the Haar and Gaussian measures, in the unsymmetrized case we have 



1 + - . (3) 



In the symmetrized case, 



and more generally. 



nX^] ^^2n^-d+0{l) ,fd=0{l) (4) 



nxsY 
ml] 



Ofe^"'/^) . (5) 



nxs? 

Additionally, we establish lower bounds on the critical ratio E[X|]/ E[X5]^. 

C*-algebra is an algebra over R or C possessing a norm || ■ || and an involution operator •* consistent in the 
sense that ||a;*a;||^ — \\x\\^ . See, e.g., [ConOOl §1] for a complete definition. 
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Theorem 2. Let A he the n x n identity matrix and d a constant. Then 



when the pij are distributed according to the Gaussian or Haar measure respectively. 

Finally, we show the critical ratio differs by at most for the Frobenius estimators than for 
those given by the square of the trace: 

Theorem 3. 

d4 E[X]2 - E[XFrob]2 - E[X]2 ' 

and similarly for Xpi-objS- 

These results give a somewhat frustrating outlook. The critical ratio for the unsymmetrized 
estimator behaves very well, becoming more and more mildly exponential as d increases, much like 
the Clifford group estimator of [CRSQ3.J . However, we do not know how to compute these estimators 
efficiently. On the other hand, we can compute the symmetrized estimator if d is constant |BarOO] . 
but our results show that its critical ratio does not decrease appreciably until d is roughly v?. 

Barvinok [BarOOj suggested that the estimators Xpi-ob.s rnight become asymptotically concen- 
trated when d is large, but constant. Specifically, he made the following conjecture (where we 
have weakened the lower bound, specialized to {0, 1} matrices, and changed the notation to fit our 
purposes): 

Conjecture 1. If A is an n x n matrix with entries in {0,1}, let M{A) he the matrix Mij = 
PijAij, where each pij is chosen independently from the Gaussian distribution on Ad. Define M(l) 
similarly, where Mij = pijSij. Then there is a sequence of constants ^4, where limf^^oo 7d = 1; such 
that for any e > 0, 

|2 



lim Pr 

Jl— >oo 



{id + e^permA < ||^|^|||^ < (7d + e)"" perm A 



1 



Our results do not address Conjecture [T] directly. However, given Chebyshev's inequality, it is 
very natural to consider the following stronger conjecture, which would imply Conjecture [T] 

Conjecture 2. There is a sequence of constants 9d, where \\m.d-,oo ^ = 1; such that for any n x n 
matrix A, the critical ratio of the estimator X-pj-^i^^s = ||M(^)||^ obeys 



< 



Sadly, Theorems[2]and[3]imply that Conjecture|2]is false. It is still conceivable that Conjecture[T] 
is true, but it seems that any proof of it would have to bound higher moments of the estimator: 
the first and second moments alone do not scale in a way that gives concentration. 

The remainder of the paper is organized as follows. In Section [2| we calculate the expectations 
of these estimators, showing that they are each a constant times the permanent, and computing 
the constant explicitly using a diagrammatic technique. In Sections |3] and |4] we bound their second 
moments using the same technique, proving Theorems [T] and |2j In Section |5] we relate the critical 
ratio for the Frobenius estimators to the trace-squared estimators, proving Theorem |3j Finally, in 
Section |6] we discuss the implications of this theorem, and the remaining barriers to an algebraic 
approximation scheme for the permanent. 
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2 The expectation 



Before we proceed, we write the following expansions for these estimators, which we will find useful 
for calculating their expectations and second moments: 



a,(5\-A \ i / \ i / 



(7) 

Pj3i,\l3i ■ (8) 



Since IE[/Ojj] = 0, any term in which some pij appears only once will have zero expectation. 
Then the cross-terms in the expansion ([T]) are zero in expectation except when q = /3, so 



QhA \ i / \ i / a\-A 



E 



Pi,ai 



2 



permj4. (9) 



Here we used the use the following fact, which is an easy exercise: if a is the product of n indepen- 

2 

dent random matrices, chosen from the Haar measure or the Gaussian measure, then E |tr cr| = 1. 

Similarly, the only terms in ([8| that contribute to E[X<j] are those where A = k, so that each 
Pij appears twice or not at all. Thus 

^{p,j}[Xs] = Y E|^j Ea,f3 tr Yl cTai (T^ J = • perm A 

k'tA \ i J \ i J 

where = E|^j E^,/? ^tr (Taj^ ^^'^I!'^^*^ ■ 



(10) 



A similar result for the Frobenius estimator XprobjS = ||sdetM||^ appears as Theorem 4.3 in Barvi- 
nok |BarOO] . 

We can think of ad as the expectation, over all pairs of permutations a,/3, of the covariance 
between the trace of two products of the same n random matrices, where the products are taken 
in the orders given by a and (3. This expectation clearly stays the same if we assume that a is the 
identity 1 and [3 is uniformly random, so we can write 



ad 



We will evaluate these covariances using a diagrammatic approach. First, suppose we have n 
linear operators cJi, . . . , (T„. The trace of their product is 

Here we save ink by using the Einstein summation convention, in which any index that appears 
twice is automatically summed over. We can think of this product as a particular kind of internal 
trace of the tensor product 
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Figure 1: The trace of the matrix product o"i(T2 • • • is a contraction of the tensor product ai ^ 
• • • (8> Combining this with (13) shows that the covariance between the traces of two permuted 
products is given by d'^~'^ where c is the number of loops in a diagram like that on the right. In 
this case, the covariance between tro"icr20"3 and tro"2(TiO"3 is since n = 3 and c = 1. 



where we contract the index it with J(j+i) mod n for each i. We draw this on the left-hand side of 
Fig. [T| Then if n = 3, say, and /3 is the the transposition (2 3), the covariance 



0'liO"2)0"3 



(trcrifT2Cr3) (tr CJiCr3fT2)* 



becomes a certain contraction of the tensor product of three independent and identical expectations. 



al] (g) [cr2 <S) ^2] ® [0-3 <S) 0-3] . 



(11) 



The following lemma is well-known; we prove it in Appendix [B] for completeness. 
Lemma 4. // a is chosen according to the Haar measure or the Gaussian measure, then 

nk 1 



We can represent (12) diagrammatically as a "cupcap," 



[a ® a*] 



1 



dn' 



(12) 



(13) 



A tensor product such as (11) becomes three cupcaps side by side, and contracting it consists 



of connecting pairs of inputs and outputs until the diagram becomes closed. For instance, the 
expectation of (tr (Ticr20'3) (tr cri(T3cr2)* corresponds to the diagram on the right-hand side of Fig.[l} 
Here we have drawn the cupcaps on the top and bottom of the diagram (between corresponding 
indices of cxj and ctj*) and the connections between them in the interior. 

When we evaluate the trace of this diagram, each of the n cupcaps introduces a factor of 



1/d according to (13), and each loop in the diagram corresponds to an index which can be set 
independently to any value between 1 and d. So, the diagram evaluates to d*^"" where c is the 
number of loops. In this case n = 3 and c = 1, and the covariance is l/d^. 

More generally, we can write the covariance between tr Y\i and tr Yli f/jj as a function of /3 as 
follows. The cupcaps match the upper indices of the as in the first product to those of the second 
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product according to and the lower indices of the second product to those of the first product 
according to I3~^. If r denotes the rotation (12 • • • n), which "weaves" the as together and takes the 
trace of their product, then following the diagram around gives a permutation on (say) the upper n 
indices of the first product (darkened in Fig. [l]) equal to the commutator [P,r] = (3r(3~^r~^. Each 
loop in the diagram corresponds to a cycle in this permutation. So, we have 

arf = ^E;3rf^([^'^" (14) 

where c(7r) denotes the number of cycles in a permutation tt. Note that we always have 

ft 

ad>-,- (15) 
n! 

This follows because, with probability n/n!, a uniformly random (5 is one of the n powers of r. In 
that case [/?, = 1, and d'^d^'^D = d". It can be shown that this bound is tight when d = a;(n^). 

The expectation ( 14 ) can be viewed as the inner product of P„, the uniform distribution over the 
conjugacy class [r] = {vr^^rTr | vr G /S^}, and the function df'^ ^ both of which are class functions — 
invariant under conjugation. Below, we show that these can be expanded in terms of the characters 
of the group Sn and analyzed using the Littlewood-Richardson rule; this yields an exact expression 
for ad'. 

Lemma 5. 

Proof. First, note that the function d'^^^'> is a class function, i.e., one which is invariant under 
conjugation. Therefore, in d'^^l^''"!) we can replace [/?, r] with C,[l3,r\C,~'^ where /3 and (" are 
uniformly random. Since 

we can treat this as the expectation of d^^'^^ where vr is the product of two uniformly random 
elements of [r], the conjugacy class consisting of cycles of length n. In other words, if Pn '■ Sn ^ 
is the uniform distribution on the conjugacy class of n-cycles, then 

TT 

where Pn * Pn is the convolution of Pn with itself, 

{Pn * P„)(vr) = Pniv) Pnir^T^) ■ 

rieSn 



We will view (17) as an inner product over S'„, 



ad = ^(P„*P„,d^(-0 , (18) 
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where the inner product over a group G of two functions /i, /2 : G — > C is defined as 

(/l,/2> = ^^/l(<7)*/2(9). 



\G\ 



To evaluate (18), we will expand P and d^^ -* in the Fourier basis, as a sum of irreducible characters 



of Sn- Recall that the characters of a finite group are orthonormal under the inner product above 
and, additionally, convolution is transformed to pointwise product in the Fourier basis. In short, 
for two characters x and ip, 

X*^=\l^'' , , and (x,V) = <^ .„ , , (19) 

[0 ifx/V', [0 ifx/V'- 

Each character of the symmetric group is associated with a Young diagram, i.e., a partition 



Ai > A2 > • • • where Aj = n. In light of the Murnaghan-Nakayama rule (Lemma 10 of 
Appendix [A|), the uniform distribution Pn over the conjugacy class [r] is supported solely on hooks, 
i.e.. Young diagrams consisting of a single ribbon of size n. Let denote the hook of height 
t + 1, in which Ai = n — t for some < f < n and Aj = 1 for 1 < i < t + 1. Let xt denote the 
corresponding character. Then dimx^ = Xt{^) = ("7^) and, again appealing to Lemma 



Xt(H) = (— 1) . Applying (19) then gives 
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we have 



(i^n,Xt) = ^^ and (P„*P„,xt) = (20) 



t 



To calculate the inner product (^d'^^'\ xt) , consider the following combinatorial representation 
of Sn- Let S be the set of strings of length n over the alphabet {1, . . . ,d}, and let Sn act on S in 
the natural way, by permuting the symbols in a given string. Given a permutation vr, the character 
Xe(^) is the number of strings fixed by vr. Since each of vr's cycles can be given an independent 
label in {1, . . . , d}, we have xt,{'^) = d'^^'^\ 

It follows that (^d^^'\xt) = {xt, XT,), the number of copies of A^ appearing in the decomposition 
of S into irreducible representations. To find this, we first decompose S into a direct sum of 
combinatorial representations consisting of strings where i appears rii times for each 

i G Then {x{ni,...,na)^Xt) is given by a Kostka number, defined as follows. First, sort 

the Hi in decreasing order so that they form a Young diagram A''. Then K^* = {x{ni,...,na)^Xt) is 
the number of semistandard tableaux of shape A^ and content N: that is, the number of ways to 
fill At with rii is for each i £ {1, . . . , d}, where each row is nondecreasing and where each column 
is strictly increasing. 

Since A^ is a hook, to specify a semistandard tableau with a given content it suffices to specify 
the content of the leftmost column. Since this column must be strictly increasing, its t + 1 entries 
must be distinct. If has k rows, i.e., if nj 7^ for k values of i, then the first one must appear 
in the top cell, but the remaining t cells can be chosen arbitrarily. Thus = C^^^)' ^^"^ ^ 
if t > /c. There are (f) (^Zi) partitions (ni, . . . ,nd) with k nonzero rii. Since 1 < A; < mm{d,n), 
summing over k then gives 

-'"^x.)^E0(::O'^t^T(:)(:::)C;O- ^^^^ 
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We can now calculate the inner product d'^^'^^ Combining (20) and (21 ) and summing over 
t, we have 



n-l 



l(Pn,d<-^)=nlY,{P*P,Xt)(d'^\xt 



t=0 
min(d,n) 



A;=l 



d\ v-4 / n — t — 1 



k j \ n — k 

t=o 



n-l mm{d,n) 

E E 

t=0 k=l 
min(d,n) 

= E 



k=l 



d\ /n-l\/k-l 
A:jU-ljv t 



d\fn 
k)\k-l 



where = if (i < n. Combining this with ( 18 ) completes the proof. 



n + d 
n + l 



n — 1 
t 



d 

n + 1/ ' 



□ 



3 The second moment in the unsymmetrized case 

Squaring ([7| — and, for aesthetic reasons, placing the conjugated ps in the second half of the ex- 
pression and changing the names of the permutations — gives 



E ( 

K,\,ii,v\-A 



-1) 



K\flU 



Pi,' 




(22) 



Now we take the expectation over the pij. As before, the only terms of this sum that contribute 
to this expectation are those in which each pij appears an even number of times. Moreover, each 
Pij must appear an equal number of times conjugated (in the first and second products) and 
unconjugated (in the third and fourth products), since Eo-[o" a] = 0. In the Gaussian measure. 



this is because E[((Tp^] 



if crj is chosen from the Gaussian distribution on C. In the Haar 



measure, the same thing is true because the tensor square o" (8> a of the defining representation of 
U((i) contains no copies of the trivial representation. 

For each term of (22), associated with a tuple {k, X, p, v), we express the total number of 
occurrences of each pij with an n x n matrix Cij. In light of the discussion above, for the terms 
that contribute to the second moment we have Cij = if Aij = 0, Cij € {2,4} if Aij = 1, and 
Ei Cij = Cij = 2n. We wih denote these conditions as C h ^. As in jKKL+QSl ICRS03| . we 
think of C as a "double cycle cover" of the bipartite graph described by A. This graph has n 
vertices on either side, and an edge between the ith vertex on the left and the jth vertex on the 
right if and only if Aij = 1. Each vertex has degree 2 or 4 in C. Thus C consists of cycles where 
each edge is covered twice, and possibly some isolated edges which are covered four times. 

We then write the second moment as a sum, over all C, of the quadruples such that (k. A, p, v) h 
(7, where this denotes the following relation: 

(k. A, /X, z^) h C 44> vr h j4 for all vr G {k. A, /i, v} 

and llvr G {k. A} | j = 7ri}| = |{7r G {^u, \ j = vri}| for all i, j G {1, . . . , n} , 
and |{7r G {k. A, p, i'} \ j = vri}| = dj for all i, j G {1, . . . , n} . 

In our discussion below, we will treat each (k, A, p^ u) as a "coloring" of C. Each double edge is 
colored {k,p), (k, z^), {X,p), or (A,z^), indicating some pair Pij,p*j appearing in the first and third 
products, or the first and fourth, and so on. Each cycle in C must alternate between (k, p) and 
(A, i^) or between (k, z^) and {X,p)- The isolated edges in C bear all four colors, indicating that 
some Pij appears in all four products. We observe that for those tuples that contribute to the 
second moment, the parity (—1)"-'*/^'^ is always 1. 
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Lemma 6. // {k, A, jjL,v)'r C for some C , then (— 1)'^-*^^'^ = 1. 

Proof. Observe that (—1)''-'*'^^ = (—1)'^ where vr = k^^ fiX^^u. We claim that the constraints we 
describe above imply that vr = 1. Consider a cycle c of C on the bipartite graph defined by A. We 
can view k,\, ^,1^ as one-to-one mappings from the n vertices on the left side to the n vertices on 
the right. If c alternates between (k, /u) and (A,z^), then restricting to the vertices on the left side 
of c we have k = and A = z^. Similarly, if c alternates between (k, v) and (A,/i), then restricting 
to these vertices gives k = u and X = fi. Finally, for an isolated edge we have k, = \ = ^ = u when 
restricted to its left endpoint. In all cases we have k~^^X~^v = 1. □ 

Thus the second moment of the unsymmetrized estimator can be written 

= E E (trlla- ) (trn-^^.AO (trll/'tMO (trll'^l- ) ' (^S) 

Many terms in this expectation can be evaluated using the same picture we gave for the expec- 
tation. Each pair Pij, p*j creates a cupcap matching a pair of indices in one of the first two products 
with a pair in one of the second two products. However, the isolated edges in C correspond to a 
fourth-order operator Eo-((t (8) o" (8) c* (8) cr*) which we calculate in the following lemma. 



Lemma 7. // a is chosen according to the Gaussian measure, then 

1 

ijinq ~ ^2 



or diagrammatically, 



E.[a0a®a*0a*] = l(^ + )^) . (25) 



If a is chosen according to the Haar measure, then 

where we write A < B if B — A is positive semidefinite. 
Proof. We have 

In the Gaussian measure, i = m, j = n, k = p, and i = q, but i ^ k or j ^ i, this gives 
\a)\ Ifj^l = l/d"^. If i = p, j = q, k = m, and £ = n, but i k oi j £, we get the same result. 



Finally, \i i = k = m = p and j = £ = n = p, we get E|cr* | = 2/d'^. 

In the Haar measure, analogous to Lemma|4]we will calculate the expectation of cr (8 o" (8 a* (8 a* 
by considering tensor powers of the defining representation a of 0{d). The tensor square o" (8 o" 
decomposes into symmetric and antisymmetric subspaces, each of which is irreducible: 

cr (8 cr = T+ © T_ . 

The dimension of t± is d± = {d^ it d)/2. We can write the projection operators onto t± in terms 
of the exchange operator V which reverses the order of the tensor product, and the identity I I : 
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Now writing cr (8> o" cr* (8) fi* = (r+ © r_) (riji ® r*), the expectation over a is the projection 
operator onto the trivial subspaces of r+ Tjjl and r_ r* : 

E,, [a a f7* a*] = 0^+®^+ © H^"®^" . (27) 



Analogous to (13), we have the handsome 

1 



n;+^^^ = (n+ © n+) . ( ) . (n+ © n+) (28) 



and similarly for 11^ . Putting these diagrams together with (27) gives 



E,[a©c7©(T* ©CT*] = (n+©n+) • (^^^J •(n+©n+) + (n^©n_)- (^^^J -(n.^n. 



1 



^ + ^-^(^ + ^))- (29) 



One can check that (29) is the projection operator onto the two-dimensional subspace spanned 
by the images of 

^ and 

that is, the vectors u = ^ ^l^i jih j^h j) and v = ^ j, In general, given two real- valued 

vectors u and v of norm 1, let IIu and IIv denote the projection operators onto the subspaces 
parallel to them, and let IIu^v be the projection operator onto the two-dimensional subspace they 
span. Then 

where we write A ^ B B — A is positive semidefinite. To see this, note that the eigenvectors of 
IIu + IIv are u it v, with eigenvalues A-t = 1 it (u, v), while their eigenvalues with respect to IIu^v 
are 1. In this case, we have (u, v) = 1/d and 

n - — ^ and n - — ^ 



Thus (29) becomes 



1 



completing the proof. □ 

The operator corresponds to the coloring (k, fi), {X, v), in which some pij appears in the 

first and third products, and another p'- appears in the second and fourth. Similarly, the operator 

corresponds to the coloring (k, v), (A, fi), in which pij appears in the first and fourth products 

and p'j^j appears in the second and third. Thus Lemma [t] tells us that, with a multiplicative cost of 
1 -|- 0(1/ d) per isolated edge in the Haar measure, we can replace a given isolated edge in C with 
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an (unordered) pair of edges. This pair can be colored in two ways: with (k, /i) and (A, u), or with 
(k, v) and (A, n). Equivalently, we can "decouple" each quadruple product p* ® p* into the 

sum of two combinations of tensor products, 

p® p® p* ® p* ^ p ® p" ® p'* ® p"* + p' ® p" ® p"* ® p'* , (30) 

where p' and p" are chosen independently. 

Next we explore the set of (k, A, ^, v) corresponding to a given C, or equivalently the set of 
colorings of C. We will call a coloring pure if every edge in C are colored {k-, p) or (A,z^). This 



corresponds to pairing the pijS in the first product in (23) with their conjugates in the third, and 
those in the second product with their conjugates in the fourth — and choosing the first term in (30 ) 
for each pij which appears in all four products. Each cycle in C has two pure colorings, and each 
isolated edge has one. Thus the number of pure colorings of C is 2*^*^) where t{C) is the number 
of cycles in C. A well-known bijection shows that (permA)^ can be written as a sum over cycle 
covers of the bipartite graph defined by A, 

(perm^)2 = ^2*(^), (31) 

or equivalently that (perm^)-^ is the total number of pure colorings. Combining this with ([o]), we 
have 

m? = Y. E 1- (32) 



ChA {k,A,/^,i/)I-C 
pure 



On the other hand, we can associate each coloring with a pure one, say by replacing the color 
(k, u) with (k, p) and (A, p) with (A, v) on each edge. If this converts a tuple of permutations 
(k, a, v') to a tuple (k, A, p, v) corresponding to a pure coloring, we will write {p' , v') h (k. A, p^ v). 



Then, at the risk of some notational overload, we write (23) as a sum over pure colorings 



Pi,Ki 



ChA {K,X,fi,u)\-C ifi' ,u')\-{k,X,ii,u) 
pure 




Now, analogous to jKKL+93l . we bound the critical ratio E[X2]/E[X]2 as the maximum ratio 



between corresponding terms in these two sums, associated with some pure coloring of some cycle 
cover. The worst possible case is when C consists entirely of isolated edges, since in that case we 
can switch the colors on each edge independently, giving 2" colorings for the single pure one. 

We can parametrize these 2" colorings by strings s £ {0, 1}", where Sj = if the coloring of 
the ith edge is pure, and 1 if its colors are switched. This produces diagrams such as those shown 
in Fig. [2| weaving a total of 8n vertices together. As in our calculation of the expectation, the 
corresponding product of traces is dP~'^^ where c is the number of loops in this diagram. 

Both the pure and "completely impure" colorings 0" and 1" — where the ps in the first product 
are all paired with those in the third or fourth respectively, and the those in the second product 
are all paired with those in the fourth or third — have 2n loops. In general, the number of loops 
is 2n minus the number of times s switches back and forth between and 1 when s is arranged 
cyclically. Specifically, there are two loops of length 4 for each i where Sj = mod m ^'^^ ^ cycle 

of length 8 for each i where Sj ^ mod n- 
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Figure 2: Terms corresponding to a given cycle cover C, where the ps in the gray box are conjugated 
and n = 5. Left, a pure coloring, which has 2n loops. Middle, a maximally impure coloring, which 
also has 2n loops. Right, a mixed coloring corresponding to the string s = 00111. A careful 
inspection shows that it has 8 loops: 6 of length 4, and 2 of length 8. 

For each even i with < i < n, there are 2(^) strings which switch back and forth i times. 
Therefore, combined with Lemma [7] we have (for a cycle cover C consisting of n isolated edges) 

E (trll/'^.- ) (trnaA.) i^rllpl^.] (trUp,^,, 





In the Gaussian measure, this expression is exact if we remove the prefactor (1 + 0(l/d))"; but in 
any case, we get a bound (1 + 0{l/d))"' in either measure. Combining this with ( |32[ ) completes the 
first part of the proof of Theorem [TJ 

4 The second moment in the symmetrized case 

Our analysis of the second moment in the symmetrized case proceeds in two steps. We begin, as 
with the unsymmetrized case, by diagrammatically analyzing the relevant traces. The result is a 
sum over double cycle covers weighted by an exponential generating function d'^^'^^ over a subset 
of the symmetric group S2n- We then show that an allied quantity can be analyzed, as in Lemma |5] 
by harmonic analysis on S2n- 

Before stating the main lemmas of this section, we introduce some further notation. As in ( |31[ ), 
t{C) denotes the number of cycles in C. As before, we let r denote the rotation (1, 2, . . . , n) G Sn- 
The expression vr"" = a'^-Ka denotes conjugation, and, for two elements 7r,(T G S„, we let (vr, o") 
denote the element of S2n given by applying tt and a to the first n and last n elements of {1, ... , 2n}, 
respectively. Finally, we let Wk denote the involution (1 n + 1)(2 n + 2) ■ ■ ■ (k n + k) with the 
convention that wq is the identity. We can then write ^[^f] in terms of the following quantity: 

„(2) , , jc((r--i,r-i)(".'3)«;,(r,r)(T'.^)«,fe) 

fc=0 ^ ^ 

Lemma 8. // the pij are drawn according to the Gaussian or Haar measure, 



d 
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We delay the proof of Lemma [8] just long enough for some comforting words regarding the major 
naining obstacle: estimatin, 
can control a larger quantity, 



(2) 

remaining obstacle: estimating a\ . While we do not have a simple, exact expression for a\ , we 



A:=0 ^ ^ 

in which the kih. term of the sum is graced with an extra factor of . With this reweighting we 

can analyze a"^^ in terms of the Fourier expansions of the class function determined by the 
Kostka numbers, and the convolution square of the conjugacy class {(r, r)'^ | a G S'2n}, determined 
by the Murnaghan-Nakayama rule. This results in the following bound. 



Lemma 9. With notation as above, 

1 



( n \ "-d — "-d — 
\n/2) 

and 

1 f2n\f2n + d-l\ , ^(2) , An^ f2n\ f2n + d - I 



^2" V n y V 2n J - - (i2nyj^ J y 2n 

Combining this with Lemmas [s] and |5] completes the proof of Q and ([s]) in Theorem [T| 
We return now to the proofs of these two lemmas. 

Proof of Lemma Squaring ([8| , the second moment of the symmetrized estimator can be written 



C {k,A,/^,i/)I-C \ i 



(33) 



Consider now a term of (33 ) corresponding to a tuple (k. A, /i, v) of the form 



In light of Lemma [7] (cf. (30)), we may "decouple" any four appearances of the same pij, resulting 
in a sum of terms in which no p appears more than twice. For this reason, we begin our analysis 
with the extra assumption that each pij appears exactly twice. For notational convenience, let us 



temporarily refer to the 2n distinct pij appearing in (34) simply by pi,p2, • • • ,/02n, this list in the 



natural order given by n and A (e.g., pi = pi^^i and pn+i = Pi,\i for i < n). For a tuple {a, P,-j,6), 



then, the cupcaps of Eq. ( 13 ) introduce edges between conjugate appearances of the same pi as 
shown in Figure 3(a) : any two indices attached by an edge are constrained to be equal. 

With this convention, the permutations p and determine a permutation w £ S2n given by 
the ordering of the conjugate appearances of the pi (when a = (5 = ^ = 5 = 1). The contraction 
determined by w and a particular (a, (3, 7, 5) is combinatorial in the sense that it merely constrains 
families of indices (among the {pi\\ and their conjugates) to be equal. Recalling that each cupcap 
contributes a factor oil/d and each cycle permits d different settings of the indices it contains, the 
value of this contraction is determined by the cycle structure of the permutation 

(r-Sr-i)(°"''^"') w-^ (r,r)(^'^) w; 
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w 




Conjugated 



(a) Cupcaps and rotations 



O O O O O O O O O O O O O O O O 

a P 15 

<mp 

d^X d^X d^X 

a (3 15 

o o o o o o o o o o o o o o o o 



(b) Symmetrization induces conjugation 



Figure 3: Contractions in the second moment computation 



see Figure [3(b)[ In particular, we may write the quantity of (34) as 

■ IE„ fl.^ A d 



— Itr ,c((r-i,r-i)('^''3)«)-l(r,r)(T''^)«)) 



where, as before, c(7r) denotes the number of cycles in the permutation vr. 

As we are interested in the expectation, over all rearrangements determined by a, /3, 7, and 5, 
the only relevant feature of the permutation w is 



W 



({l,...,n}n{n + l,...,2n}) = {(i, At(f))} n {(i, KO)} 



(35) 



the number of Cj carried from the "K-block" to the "z/-block." Defining Wk = (1 n + 1) • • • (A: n + fc). 



we may rewrite the expectation of ( 34 ) as 



— V a ,^c(('--\r-i)("'««>fe(r,r)(''.«)«;fc) 

As in Section [sj for a given double cycle cover C, a coloring (k, A, /i, z^) h C is determined by 
selecting, for each nontrivial cycle c of C, whether c's colors alternate between (k, /i) and (A, u) or 
(k, v) and (A, and the parity of this coloring. In light of the decoupling equation ( pO] ), we may 
treat each isolated edge as an "unordered pair" of edges that can be colored in two possible ways, 
with (k,//) and (A,i/) or {n,v) and (A,//). Recall that in the case of Haar measure, this introduces 
a factor 1 + 0{\/d) for each isolated edge, giving the factor (1 + 0{\/d))'^. 



Observe now that the value of k determined in Eq. ( 35 ) is unaffected by the choice of parity in a 



nontrivial cycle. The other choices described above (determining the colors involved in a nontrivial 
cycle or isolated edge) have the effect of exchanging a family of pij in the //-block with a family in 
the v-h\ock. 

In particular, focusing on the portion of the second moment corresponding to a particular double 
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cycle cover C, we have 

(k,A,/^,i/)|-C \ i / \ i / \ i / \ i / 

{K,X,fl,l/)\-C' 

< ^ + O y Q ^c({r-,.-)(-«.,(.,.)(-^)-.) (37) 

l + oQ))%*(^)af . (38) 



Summing over all cycle covers C h A and applying ( |3ip completes the proof. For the Gaussian 
measure, the same proof applies without the factor (1 + 0(1/^))". □ 

We return to the proof of Lemma |9| 

Proof of Lemma [5[ The inequality 

1 ~(2) . ^(2) . -(2) 



/2 



(n/2) 



is immediate from the fact that the terms of the sums defining these quantities are positive. We 
introduce some further notation: for a permutation tt G 6*271, we define 

TT^ = I i G {1, . . . , n}, vri G {n+1, . . . , 2n}} and vr^^ = |i | z G {^^+l, • • • , 2n}, vr? G {1, . . . , n}| . 

Then Itt^^I = |7r-''| and, if vr is selected uniformly in S2n, Pr[|7r1^| = A:] = (^)^/(^^)- Observe also 
that if a, (3, 7, and 6 are chosen uniformly from 5^, the element {'j,6)wk{a,f3) is uniform in the 
set {vr I Ivrl^l = k}. Recalling that d'^^''^ is a class function, 

J— . ^(2) _ ^ V /^"^V— F « . J<(r-l,r-l)(-.««,fc(r,r)(^>*)«;,) .oq^ 



V/^^V— F . Jc((r-i,r-i)(a,/3)-i«;fc(7,5)-l(r,r)(7,5)«;;,(a,/3)) 



1 / n\ ^ 1 

72n 

\ n J k 

— E W<('^"'.'^"')('^'0") - JL]E E jc({r,r)-{r,r)-) 



where vr and cr are chosen uniformly at random from S2n- Here we use the fact that any element 
of 5*271 — in this case, (r, r) — is in the same conjugacy class as its inverse. 
Defining P^.n to be the uniform distribution on the conjugacy class 

[{r,r)]={{r,rr \ n e S2n} C S2n , 

we may express the quantity above as an inner product 



1 



2n\ ^2n 



(2) 



\-{2n)\{d^^-\Pn,n*Pn,n)- (40) 
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As in the proof of Lemma [5] we compute this inner product by determining the Fourier expan- 
sions of the class functions (F^ ' and Pn,n- By the Murnaghan-Nakayama rule, xaC'^j'^) = unless 
the tableau A can be expressed as the union of two n-ribbon tiles. Any such tableau has rank (the 
number of cells on the diagonal) no more than two and can be conveniently expressed in terms of 
its characteristics: defining aj and hi to be the number of cells below and to the right of the ith 
box of the diagonal, respectively, we use the notation r = (5i, 62, • • • > I ai, 02, . . . , 6r) to describe 
the tableau (see Figure E|. If Xrin^n) is nonzero, so that r can be written as the union of two 



1 



Figure 4: A Young tableau decomposed into two n-ribbon tiles, 
n-ribbons, we find (again appealing to the Murnaghan-Nakayama rule) that either 

• r = (61,62 I 01)02) has rank two, oi -|- 62 -|- 1 = 02 + 61 -|- 1 = n, and Xrin^n) = ±2, or 

• r = (61 I ai) has rank one and Xr{n,n) = ±1. 

We let Tn denote the family of representations of S2n described above; note that \Tn\ < ri^. Observe 
that for each r G T„, {Pn,n,Xr) = j^^Xrin^n) (where Xr{n,n) G {ibl,ib2}). Recalling that 



X*X 



\G\ 

x(i 



-X 



for any irreducible character x of a group G, we may express 

1 Xr(ra,n)2 



{Pn,n * Pn.m Xt) 



(2n)! dimr 



As discussed in the proof of Lemma [5] 



d'^\Xr) = {xi:,Xr) = Yl ^P' 



(PlviPd) 

where xs is the permutation representation given by the action of S2n on the set {(ai, . . . ,a2n \ 
tti £ {1, . . . , d}} and Kp is the Kostka number, equal to the number of semistandard tableaux of 
shape r with pi appearances of the number i. Then 



(41) 



(Plv.Pd) 

J]p,:=2n 
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Note that for each r G T„, Xrinju)"^ < 4 and < dimr, as dimr is the number of semistandard 
tableaux of shape r with distinct entries in any totally ordered set. Thus, 

(.) ^^/2n + d-l\ ^ /2n + d-l\ ^/2n + d-l 



On the other hand, each term in the sum of (41) is positive; thus 



{Pn,n*Pn,n,d<-'^) > {Pn,n * Pn,n, Xl) {d'^'K Xl) = ^ (^"^ ^ ' ^^^^ 



We conclude that 

2n\\ 2n - ^ ' ' ' ' - 2nl \ 2n 



which, in conjunction with (40), completes the proof of Lemma [9j □ 

Now we apply these Lemmas to prove an upper bound on the critical ratio E[X^]/E[Xs]^. If d 
is constant, which is the only case for which we have an efficient algorithm to compute Xg |BarOO] . 
our bound is not very inspiring. If n > d, combining Lemmas |5] [Sj and [9] gives 



E[X^]2 - V"/V 2n ) I \n^\) ^ ' ' \ n + d J / \ d 

assuming that d = 0(1). This proves Q in Theorem [T| and suggests that d needs to grow with n 
to give a good estimator. 

On the other hand, when d grows fast enough with n, we find that the critical ratio behaves 
quite well. Combining Lemma [9] with the lower bound (15) gives 

< 4n!2 /2n\ f2n + d-l\^J_ (2n + d-l)! ^J..l\ 2n - l \ 4„2/, 

E[X,]2 - d2« V ™ y V 2n ) d2n {d-l)\ \ d) V d )- 

completing the proof of ^ in Theorem [TJ 

In the critical case where d = 0(1) the upper bound of establish above is tight 

up to the factor introduced by our "approximation" of a^^^ by CL'f' — that is, a factor of (^"2)- 
particular, even for the identity matrix, we can establish a 2"n^'^"'"*^'^^) lower bound on the critical 
ratio: 

Theorem (Restatement of Theorem |2]). Let A he the nxn identity matrix and d a constant. Then 



E[Xs]^ \n'^) E[X^]2 V \d) J \n'^, 

when the pij are distributed according to the Gaussian or Haar measure respectively. 

Proof of Theorem Let d be a constant and A the nxn identity matrix. Then perm A 
perm^ A = 1 and, from Lemma [sj 
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As for the second moment, the only nontrivial term in the sum (33) corresponds to the case where 
the permutations k. A, fi, and v are the identity. In this case each pij appears four times and there 
are precisely (^) terms of (36) for which Wk,x = Wk] in particular, in this case the inequality of (37) 
is an equality. Recalling Lemma [7j we conclude that 



E[X'',] = af and E[X,"] > (1 - 0{l/d)T 
when the pij have Gaussian measure and Haar measure, respectively. For constant d we have 



.(2) 



x(2) 



and, considering that (^L) 



> 



and 



a 



(2) 



f2n\ f2n+d-l\ 
\n)\ 2n I 



in/2) 



1/2; 
f2n+d-l 



I n \ (n+d\ '■ 
\n/2) \n+l) 

^ > /n+d\ 
2n J — 



.(2) 



The statement of the theorem follows. 



2" 



□ 



5 Estimators based on the Frobenius norm 

In this section, we prove Theorem [3] by relating the moments of Frobenius estimators, Xprob = 

1 1 2 1 1 2 — 

||detM|| and Xpi-ob.s = ||sdetM|| , to those of the trace-squared estimators we studied above. 

As Fig. |5] shows, the diagrams corresponding to the expectations and second moments of these 
estimators differ from those of their counterparts by a small number of local moves. Let Q be the 
product of some sequence of pij. Then all we have to do is change our previous contraction, 

\trQ\^ = QiQi 

where the "output" of each Q is connected to its "input," to 

\\Qf = trQQ^ = Q}{Q^y,=Q}{Q*)]. 

In this contraction, we connect the output of each Q to the output of the corresponding Q* , and 
similarly wire their inputs together. The cupcaps, resulting from taking the expectation of p 
for each pij appearing in these products, remain the same as before. 

Now recall that the expectation and second moment of these estimators is proportional to d'^, 
where c is the number of loops in these diagrams. Each of these rewiring moves changes the number 
of loops by at most one, by cutting one loop into two or merging two loops into one. Thus we have 

i E[X] < E[XFrob] < d E[X] and 1 E[X^] < E[Xl^^] < d^ E[X^] , 

and similarly in the symmetrized case. Assuming the worst regarding these bounds yields ([6|, and 
completes the proof of Theorem [T] 
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Figure 5: Rewiring the diagram to change |trM|^ to trMM^ = ||Mf . The cup caps remain 
unchanged, but instead of wiring the "input" of each product to its "output," we wire a pair of 
products together "input" to "input" and "output" to "output." 



6 Conclusions 

As we stated in the Introduction, our results present us with the following irony. For the estimators 
based on the unsymmetrized determinant, which we do not know how to compute efficiently, the 
critical ratio E[X^]/E[X]^ becomes more mildly exponential as d increases. Specifically, for any 
e > we can make the critical ratio 0((1 + e)") by taking d = 1/e. 

On the other hand, for the estimators based on the symmetrized determinant, the critical ratio is 
r2(2") in the case d = 0(1) where we have an efficient algorithm. In order to reduce this exponential 
to 0(c") for some c < 2, we need d to be a growing function of n. This is contrary to the intuition 
expressed in |BarOOj . and to our own initial intuition when we began work on this problem. 

Of course, the symmetrized estimators may still be tightly concentrated, as conjectured 
in |BarOO] . However, since their variance is large, any proof of concentration would have to bound, 
implicitly or explicitly, their higher moments. 

At this point, finding an algebraic polynomial-time approximation scheme for the permanent 
seems to require progress on at least one of several fronts. One approach would be to seek a 
polynomial-time algorithm for sdet M in the case where M's entries belong to where d = poly(n), 
but it seems difficult to scale up the algorithm of |BarOO| beyond d = 0{1). Another approach, as 
suggested in [CRS03] , would be to seek an algorithm for det M where M's entries belong to some 
group with representations of arbitrarily high dimension. However, it seems difficult to construct 
a succinct description for the group algebra elements which appear in the determinant, since their 
support in the group basis is exponentially large. 
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A Representation theory and the symmetric group 

We briefly discuss the elements of the representation theory of groups, and of the symmetric groups 
in particular. Our treatment is primarily for the purposes of setting down notation; we refer the 
reader to |JK81] for a complete account. 

Let G be a finite group. A representation /) of G is a homomorphism p : G ^ U(y), where V is 
a finite-dimensional Hilbert space and U(y) is the group of unitary operators on V . The dimension 
of p, denoted dp, is the dimension of the vector space V . By choosing a basis for V , then, we can 
identify each p{g) with a unitary dp x dp matrix; these matrices then satisfy p{gh) = p{g) ■ p{h) for 
every g,h G G. 

Fixing a representation p : G ^ U(y), we say that a subspace W <ZV \s invariant if p{g)W C W 
for all g G G. We say p is irreducible if it has no invariant subspaces other than the trivial space 
{0} and V . If two representations p and a are the same up to a unitary change of basis, we say that 
they are equivalent. It is a fact that any finite group G has a finite number of distinct irreducible 
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representations up to equivalence and, for a group G, wc let G denote a set of representations 
containing exactly one from each equivalence class. The irreducible representations of G give rise 
to the Fourier transform. Specifically, for a function f : G ^ C and an element p £ G, define the 
Fourier transform of f at p to be 




geG 

The leading coefficients are chosen to make the transform unitary, so that it preserves inner prod- 
ucts: 

(/i,/2) =Y.fi(9)f2{9) = j;tr(A(p)t . f,{p)) . 

9 peG 

In the case when p is not irreducible, it can be decomposed into a direct sum of irreducible 
representations, each one of which operates on an invariant subspace. We write p = cri • • • (7^ 
and, for the ai appearing at least once in this decomposition, at -< p. In general, a given a can 
appear multiple times, in the sense that p can have an invariant subspace isomorphic to the direct 
sum of Oct copies of a. In this case is called the multiplicity of a in p, and we write p = ©^.^p aacr. 

For a representation p we define its character as the trace Xp(ff) = tr p{g). Given an element m, 
we denote its conjugacy class [m] = {g~^mg \ g G G}. Since the trace is invariant under conjugation, 
characters are constant on the conjugacy classes, and we write = Xp(^) where m is any 

element of [m] . Characters are a powerful tool for reasoning about the decomposition of reducible 
representations. In particular, for p,a E G, we have the orthogonality conditions 

{xp,xa)G = TLj2^p^s)xA9y 

1^1 geG 

If p is reducible, we have xp = Yla^p ^^Xai, and so the multiplicity is given by 

< = (Xp,X<t)g • 

If p is irreducible, Schur's lemma asserts that the only matrices which commute with p(g) for 
all g are the scalars, {cl | c G C}. Therefore, for any A we have 

-^Ep(5)Up(5) = *^1., (43) 
geG 

since conjugating this sum by p{g) simply permutes its terms. 

We specialize now to the case of the symmetric group Sn of permutations of the set {1, . . . , n}. 
The representations of Sn are in one-to-one correspondence with Young diagrams or, equivalently, 
integer partitions A = (Ai,A2,---) where Ai > A2 > • • • and Aj = n. The character of this 
representation is denoted xx- The Murnaghan-Nakayama rule gives a recursive formula for the 
character xx- preparation for stating the rule, we define a ribbon tile of length A; to be a 
polyomino of k cells, arranged in a path where each step is up or to the right. 

Lemma 10 (Murnaghan-Nakayama rule). Given a Young diagram A and a permutation vr with 
cycle structure ki > k2 > ■ ■ ■ , a consistent tiling of A consists of removing a ribbon tile of length ki 
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from the boundary of \, then one of length k2, and so on, with the requirement that the remaining 
part of X is a Young diagram at each step. Let hi denote the height of the ribbon tile corresponding 
to the ith cycle: then 

Xx{^) = Y.Il(-^y''^' (44) 

T i 

where the sum is over all consistent tilings T. 

B Proof of Lemma [4] 

Proof. For the Gaussian measure, this is simply the fact that {a ® a*y^^ = (J*(o"^)*. If i 7^ A; or 

j 7^ then this is the product of two independent random variables both of whom have expectation 

. 2 

zero. If i = A; and j = i, then this is o"'- , whose expectation is 1/d. 



For the Haar measure, (12) follows from a little representation theory. (For a brief introduction 
to representation theory, see Appendix [A}) Abusing notation, suppose that a is the defining rep- 
resentation of the group U(d) of unitary matrices, i.e., the d-dimensional representation in which 
unitary matrices act on column vectors in the natural way. Then o" cr* is isomorphic to the 
conjugation action of \J{d) on GL{d), the vector space of d x d matrices. We can decompose this 
into the direct sum of two invariant subspaces cr^cr* = l©r, where 1 is the trivial representation, 
consisting of the scalar matrices, and F is the (d^ — l)-dimensional representation consisting of d x d 
matrices with zero trace. Both these subspaces are clearly invariant under conjugation, and are, in 
fact, irreducible. Taking the expectation over a E U((i) gives the projection operator IIJ®'^ onto 
the trivial subspace — that is, the linear operator on the space of matrices which takes a matrix 



A = Ajfc and returns a scalar whose trace is tr^. We claim that this operator is exactly (12), since 



Here we again use the Einstein summation convention, so that = tr A, and the identity matrix 
is 1 = bji. □ 
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